angry bird
DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning
Xu, Xinrun, Bu, Pi, Wang, Ye, Karlsson, Börje F., Wang, Ziming, Song, Tengtao, Zhu, Qi, Song, Jun, Ding, Zhiming, Zheng, Bo
Although Vision Language Models (VLMs) exhibit strong perceptual abilities and impressive visual reasoning, they struggle with attention to detail and precise action planning in complex, dynamic environments, leading to subpar performance. Real-world tasks typically require complex interactions, advanced spatial reasoning, long-term planning, and continuous strategy refinement, usually necessitating understanding the physics rules of the target scenario. However, evaluating these capabilities in real-world scenarios is often prohibitively expensive. To bridge this gap, we introduce DeepPHY, a novel benchmark framework designed to systematically evaluate VLMs' understanding and reasoning about fundamental physical principles through a series of challenging simulated environments. DeepPHY integrates multiple physical reasoning environments of varying difficulty levels and incorporates fine-grained evaluation metrics. Our evaluation finds that even state-of-the-art VLMs struggle to translate descriptive physical knowledge into precise, predictive control.
Label-Free Subjective Player Experience Modelling via Let's Play Videos
Goel, Dave, Mahmoudi-Nejad, Athar, Guzdial, Matthew
Player Experience Modelling (PEM) is the study of AI techniques applied to modelling a player's experience within a video game. PEM development can be labour-intensive, requiring expert hand-authoring or specialized data collection. In this work, we propose a novel PEM development approach, approximating player experience from gameplay video. We evaluate this approach predicting affect in the game Angry Birds via a human subject study. We validate that our PEM can strongly correlate with self-reported and sensor measures of affect, demonstrating the potential of this approach.
- North America > Canada > Alberta (0.14)
- North America > United States > New York > New York County > New York City (0.04)
- Leisure & Entertainment > Games > Computer Games (1.00)
- Health & Medicine > Therapeutic Area (1.00)
Utilizing Generative Adversarial Networks for Stable Structure Generation in Angry Birds
Abraham, Frederic, Stephenson, Matthew
This paper investigates the suitability of using Generative Adversarial Networks (GANs) to generate stable structures for the physics-based puzzle game Angry Birds. While previous applications of GANs for level generation have been mostly limited to tile-based representations, this paper explores their suitability for creating stable structures made from multiple smaller blocks. This includes a detailed encoding/decoding process for converting between Angry Birds level descriptions and a suitable grid-based representation, as well as utilizing state-of-the-art GAN architectures and training methods to produce new structure designs. Our results show that GANs can be successfully applied to generate a varied range of complex and stable Angry Birds structures.
Physics-Based Task Generation through Causal Sequence of Physical Interactions
Gamage, Chathura, Pinto, Vimukthini, Stephenson, Matthew, Renz, Jochen
Performing tasks in a physical environment is a crucial yet challenging problem for AI systems operating in the real world. Physics simulation-based tasks are often employed to facilitate research that addresses this challenge. In this paper, first, we present a systematic approach for defining a physical scenario using a causal sequence of physical interactions between objects. Then, we propose a methodology for generating tasks in a physics-simulating environment using these defined scenarios as inputs. Our approach enables a better understanding of the granular mechanics required for solving physics-based tasks, thereby facilitating accurate evaluation of AI systems' physical reasoning capabilities. We demonstrate our proposed task generation methodology using the physics-based puzzle game Angry Birds and evaluate the generated tasks using a range of metrics, including physical stability, solvability using intended physical interactions, and accidental solvability using unintended solutions. We believe that the tasks generated using our proposed methodology can facilitate a nuanced evaluation of physical reasoning agents, thus paving the way for the development of agents for more sophisticated real-world applications.
- Oceania > Australia > South Australia > Adelaide (0.04)
- Oceania > Australia > Australian Capital Territory > Canberra (0.04)
- Workflow (0.93)
- Research Report > Promising Solution (0.46)
Heuristic Search For Physics-Based Problems: Angry Birds in PDDL+
Piotrowski, Wiktor, Sher, Yoni, Grover, Sachin, Stern, Roni, Mohan, Shiwali
This paper studies how a domain-independent planner and combinatorial search can be employed to play Angry Birds, a well established AI challenge problem. To model the game, we use PDDL+, a planning language for mixed discrete/continuous domains that supports durative processes and exogenous events. The paper describes the model and identifies key design decisions that reduce the problem complexity. In addition, we propose several domain-specific enhancements including heuristics and a search technique similar to preferred operators. Together, they alleviate the complexity of combinatorial search. We evaluate our approach by comparing its performance with dedicated domain-specific solvers on a range of Angry Birds levels. The results show that our performance is on par with these domain-specific approaches in most levels, even without using our domain-specific search enhancements.
- Europe > Czechia > Prague (0.04)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- Asia > Middle East > Israel > Southern District > Beer-Sheva (0.04)
Children addicted to video games are attacking their PARENTS, experts warn
Parents shouldn't take away consoles from children who show signs of gaming addiction because it could lead to physical violence, a psychologist has warned. Professor Henrietta Bowden-Jones, head of the National Centre for Gaming Disorders, says it can become a police matter when kids lose access to gaming. The centre – which along with the World Health Organization recognises gaming addiction as a disorder – is seeing teens who are gaming up to 14 hours a day. Children are hooked on Call of Duty, Fortnite, FIFA, Angry Birds, War Zone and Minecraft, but no game is necessarily more addictive than any other. According to the centre's founder, it has dealt with 745 patients since it opened in October 2019, including 327 last year.
- Leisure & Entertainment > Games > Computer Games (1.00)
- Health & Medicine (1.00)
Phy-Q: A Benchmark for Physical Reasoning
Xue, Cheng, Pinto, Vimukthini, Gamage, Chathura, Nikonova, Ekaterina, Zhang, Peng, Renz, Jochen
Humans are well-versed in reasoning about the behaviors of physical objects when choosing actions to accomplish tasks, while it remains a major challenge for AI. To facilitate research addressing this problem, we propose a new benchmark that requires an agent to reason about physical scenarios and take an action accordingly. Inspired by the physical knowledge acquired in infancy and the capabilities required for robots to operate in real-world environments, we identify 15 essential physical scenarios. For each scenario, we create a wide variety of distinct task templates, and we ensure all the task templates within the same scenario can be solved by using one specific physical rule. By having such a design, we evaluate two distinct levels of generalization, namely the local generalization and the broad generalization. We conduct an extensive evaluation with human players, learning agents with varying input types and architectures, and heuristic agents with different strategies. The benchmark gives a Phy-Q (physical reasoning quotient) score that reflects the physical reasoning ability of the agents. Our evaluation shows that 1) all agents fail to reach human performance, and 2) learning agents, even with good local generalization ability, struggle to learn the underlying physical reasoning rules and fail to generalize broadly. We encourage the development of intelligent agents with broad generalization abilities in physical domains.
Playing Angry Birds with a Domain-Independent PDDL+ Planner
Piotrowski, Wiktor, Stern, Roni, Klenk, Matthew, Perez, Alexandre, Mohan, Shiwali, de Kleer, Johan, Le, Jacob
This demo paper presents the first system for playing the popular Angry Birds game using a domain-independent planner. Our system models Angry Birds levels using PDDL+, a planning language for mixed discrete/continuous domains. It uses a domain-independent PDDL+ planner to generate plans and executes them. In this demo paper, we present the system's PDDL+ model for this domain, identify key design decisions that reduce the problem complexity, and compare the performance of our system to model-specific methods for this domain. The results show that our system's performance is on par with other domain-specific systems for Angry Birds, suggesting the applicability of domain-independent planning to this benchmark AI challenge.
- North America > United States > California > Santa Clara County > Palo Alto (0.05)
- Asia > Middle East > Israel > Southern District > Beer-Sheva (0.05)
The Difficulty of Novelty Detection in Open-World Physical Domains: An Application to Angry Birds
Pinto, Vimukthini, Xue, Cheng, Gamage, Chathura Nagoda, Renz, Jochen
Detecting and responding to novel situations in open-world environments is a key capability of human cognition. Current artificial intelligence (AI) researchers strive to develop systems that can perform in open-world environments. Novelty detection is an important ability of such AI systems. In an open-world, novelties appear in various forms and the difficulty to detect them varies. Therefore, to accurately evaluate the detection capability of AI systems, it is necessary to investigate the difficulty to detect novelties. In this paper, we propose a qualitative physics-based method to quantify the difficulty of novelty detection focusing on open-world physical domains. We apply our method in a popular physics simulation game, Angry Birds. We conduct an experiment with human players with different novelties in Angry Birds to validate our method. Results indicate that the calculated difficulty values are in line with the detection difficulty of the human players.
Deceptive Level Generation for Angry Birds
Gamage, Chathura, Stephenson, Matthew, Pinto, Vimukthini, Renz, Jochen
The Angry Birds AI competition has been held over many years to encourage the development of AI agents that can play Angry Birds game levels better than human players. Many different agents with various approaches have been employed over the competition's lifetime to solve this task. Even though the performance of these agents has increased significantly over the past few years, they still show major drawbacks in playing deceptive levels. This is because most of the current agents try to identify the best next shot rather than planning an effective sequence of shots. In order to encourage advancements in such agents, we present an automated methodology to generate deceptive game levels for Angry Birds. Even though there are many existing content generators for Angry Birds, they do not focus on generating deceptive levels. In this paper, we propose a procedure to generate deceptive levels for six deception categories that can fool the state-of-the-art Angry Birds playing AI agents. Our results show that generated deceptive levels exhibit similar characteristics of human-created deceptive levels. Additionally, we define metrics to measure the stability, solvability, and degree of deception of the generated levels.
- Europe > Netherlands > Limburg > Maastricht (0.04)
- Oceania > Australia > Australian Capital Territory > Canberra (0.04)
- North America > United States > New York > New York County > New York City (0.04)